A STABILITY VERSION OF HOLDER'S INEQUALITY 



J. M. ALDAZ 

Abstract. We present a stability version of Holder's inequality, incorporating an extra term 
that measures the deviation from equality. Applications are given. 



1. Introduction. 

In the field of geometric inequalities, the expression Bonnesen type is used after Bonnesen 
classical refinement of the isoperimetric inequality (cf., for instance, |Oslj . }Os2j ). where the 
deviation from the case of equality (the disk) is given in terms of the outer radius and the 
inradius of a bounded convex body. The term stability type inequality is also used in a related 
way (cf. [Grp . meaning that if the deviation from equality is "small", then the objects under 
consideration must be "close" to the extremal object. 

Here we explore the question of what a Bonnesen or stability version of Holder's inequality 
should look like, as we move away from the equality case. Since the functions / and g involved 
in Holder's inequality will usually belong to different spaces, before they can be compared we 
need to map these functions, with controlled distortion, into a "common measuring ground" . 
The way we choose to do this is by first normalizing, and then applying the Mazur map from 
LP and L q to L 2 . For nonnegative functions in the unit sphere of LP the Mazur map into L 2 
is defined by / i— > f p l 2 . We will be able to utilize its well known properties (cf. for instance, 
|BeLij ) to obtain useful estimates. 

As a model for the stability version of Holder's inequality, we use the (real) Hilbert space 
parallelogram identity, suitably rearranged under the assumption that the vectors are nonzero 
(see (I2.0.2p below). With (12.0.21) in mind we obtain a natural, straightforward generalization 
of the parallelogram identity, valid for 1 < p < oo, though when p ^ 2 equality will of course 
be lost, cf. (12.2. ip . After one has decided which inequality to prove, the argument is standard. 
In fact, it is the standard argument: From a refined Young's inequality one obtains a refined 
Holder inequality, which in turn entails a refined triangle inequality, which (together with a 
simple additional observation) yields the uniform convexity of L p spaces in the real valued 
case, with optimal power type estimates for the modulus of convexity. 

Like the parallelogram identity in the Hilbert space setting, (12.2. ip brings to the fore the 
geometry of L p spaces, and conveys essentially the same information: In order for ||/<7||i to 
be close to the angle between the nonnegative L 2 functions \f\ p ^ 2 and \g\ q ^ 2 must 

be small, with equality in ||/<7||i < H/HplMlg precisely when the angle is zero. Since Holder's 
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inequality is one of the most often used inequalities, the refinement given here is likely to 
have repercussions far beyond the few applications presented below. 

The paper is organized as follows. Section 2 contains the basic inequality and its proof, 
together with the precedents I have been able to find, and a small discussion as to why some 
plausible improvements of (12.2.11) cannot hold. Section 3 establishes a few direct consequences 
regarding bounds on interpolated norms. Specializing the previous remark about angles to 
the function 1 on a probability space, we obtain a stability version of the following standard 
application of Holder's inequality: If < r < s, then every / G L s satisfies ||/|| r < ||/|| s , 
with equality if and only if |/| is constant. As we noted, the norms ||/|| a and ||/|| r will be 
close if and only if the angle between 1 and |/| S//2 is small (cf. Theorem 13. ip . Expressing this 
result in terms of the variance of |/| s/2 , we shall see that ||/|| s and ||/|| r are close if and only 
if the normalized variance Var (|/| s/2 /|||/| s/2 || 2 ) is sufficiently small, cf. Corollary [321 These 
results provide qualitative information about the behavior of LP norms, which apparently had 
not been noticed before. Finally, Section 4 contains a sharpened triangle inequality, leading 
to the proof of uniform convexity announced above. 

We work on an arbitrary measure space (X, A, p), whose mention will usually be omitted; 
to avoid trivialities we assume that \x is not identically zero, and (when dealing with uniform 
convexity) that X contains at least two points. 



2. The basic inequality. 



In this paper p and q always denote conjugate exponents, i. e., q = p/(p — 1), and unless 
otherwise stated, it is understood that / G L p , g G L q and neither function is zero almost 
everywhere. To motivate the variant of Holder's inequality given below, let us consider first 
the situation in a real Hilbert space setting. From the parallelogram identity 

(2.0.1) \\x 

we get, after expanding ||x+|/|| 2 , replacing x by tx, taking t 
the equality 

2 N 



2||a;|| 2 + 2||y|| 2 



x 



and factoring \\y\ 



(2.0.2) 



(x,y) = \\x\\\\y\\ 1 



x 



y 



X 



\y\ 



valid for nonzero x and y. We follow this line of thought in the L p setting, using (12.0.21) as 
a model. Observe that the identity (12.0.2P can be regarded as a stability version (and also a 
proof) of the Cauchy-Schwarz inequality. 

The first step is to refine Young's inequality u p /p + v q /q — uv > 0. 

Lemma 2.1. Let 1 < p < 2 and let q be its conjugate exponent. Then for all u, v > 



(2.1.1) 



- (u p ' 2 
q 



,9/2 



)■ 



U V H 
— + - 

p q p 

Proof. If p = 2 = q the result is trivial, so assume 1 < p < 2. We prove the first inequality; 
the second can be obtained via an essentially identical argument, by interchanging the roles 
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of p and q, and of u and v. If either u = or v = 0, formula (12.1.11) is obviously true. Fix p, 
fix u > 0, and suppose t> > 0. Expanding the square and simplifying, we see that it is enough 
to check the following inequality: 

2 



(2.1.2) 



f(v) 



2-p 
p 



-u p + -u p/ V /2 - uv > 0. 



Now v = u p is the unique solution of f'{v) = 0. Since f" > 0, f(u 
minimum of /. 



P -i\ 



is the global 

□ 



An extension of (12.0. 2D to the case 1 < p < oo follows now by repeating the steps in the 
usual derivation of Holder's inequality from Young's inequality. Only minimal modifications 
to the Hilbert space argument given above are needed, though of course, the equality becomes 
a two sided inequality when p ^ 2. We write t + := max{t, 0} for the positive part of a real 
number or a real valued function, and t r + := (max{t, 0}) r , so the maximum is taken first. The 

left hand side of the identity ||/||£ /2 = |||/| p/2 || 2 seems to be typographically more convenient 
and easier to read than the right hand side, so we will use it below. However, it makes it less 
obvious that in (12XTD the functions \ f\ p/2 /\\f\\ P P /2 and \g\ q/2 /\\g\\q 2 are simply norm 1 vectors 
in L 2 (so we are in fact dealing with the angle between \f\ p ^ 2 and l^l 9 ^ 2 , cf. Remark 12. 3p . 



Theorem 2.2. Let 1 < p < oo and let q = p/(p 
g G L q , ll/Hp, \\g\\ q > 0, andl<p<2, then 
(2.2.1) 



1) be its conjugate exponent. If f e L p , 



\9\ 



V 



m p/2 



\9 



q/2 



p/2 
V 



\a\\ ql2 

\g \\q 



< ll/slh < 



1-1 

Q 



\f\ p/2 



1 9/2 



p/2 
P 



I 119/2 

\9M 



while if 2 < p < oo, the terms 1/p and 1/q exchange their positions in the preceding inequal- 
ities. 



Proof. Suppose 1 < p < 2. Write u = \f(x)\ and v = \g(x)\ in (I2.1.ip . integrate, substitute 

tf for /, and set t = \\g\\q 1 Vll/llp- Now ( 12.2. lft immediately follows. If 2 < p < oo, just 
interchange the roles of p and q. □ 



Of course, when p = 2 the inequality ( 12.2. ip follows from ( 12.0.21) . and in fact, it is identical 
to it, save for the fact that only nonnegative functions appear in (12.2.11) . 

The reason why we take the positive part in the left hand side of (12.2.11) . is that in some 
inequalities given below we will need to take powers of the corresponding quantities. 

Remark 2.3. Recall that in a real inner product space, the angle Z(x,y) between x and y 
is defined by 



Z(x,y) :-- 



arccos 



jx,y) 
MUM 



arccos 
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where the second equality follows from (12.0. 2ft . Actually, the simpler expression 8(x,y) := 

Pll — , giving the length of the segment between and y/||y||, is often taken as 

;he definition of angle in a general Banach space (cf., for instance, pg. 403 of (Clj). In 
the real Hilbert space setting, Z(x,y) and 9(x,y) are clearly comparable quantities (in fact, 
0(x,y) < Z(x,y) < (n/2)9(x,y)) so up to a constant it does not matter which one is used. 
Thus, the geometric content of (12.2. ip is clear: ||/g||i ~ ||/||p||fli| 9 if and only if the angle 

A\f\ p/2 , \a\ q/2 ) is smalL Note also that the 

same term 9 2 (\f\ p ^ 2 , \g\ q ^ 2 ) appears both on the 
left and on the right hand sides of (12.2. ip : hence, the exponent 2 cannot be improved. This 
helps to explain why from (12.2.11) we obtain optimal asymptotic power type estimates for the 
modulus of convexity of L P (X, K) spaces. 

Observe that if / and g have disjoint supports then (12.2.11) becomes 

(2-3.1) \\fU\g\U (i--) < \\fg\\i = o < ||/ Ubi| 9 (l-- 



P/+ V 1 

Hence, the right hand side bound worsens as p — > 1 (and q — > oo). Note also that the constant 
1/2 appears, instead of 1/p and 1/q, both in (12.0.21) above and in (12.4.41) below. Thus, it is 
natural to ask whether it is possible to improve at least one of the factors 1/p, 1/q in (12.2.11) . 
replacing it by 1/2 (of course, when supports are disjoint we cannot do better than writing 
on the left hand side, but under less than full orthogonality, the change from 1/p to 1/2 
might be useful). Next we show that such change is not possible. 

Example 2.4. Let 1 < p < 2. Replacing 1/q by 1/2 in the right hand side of (12.2.11) and 
simplifying we find that this modification of the second inequality is equivalent to 

(2-4.1) / \fg\ < \\f\t p/2 \\g\\ l - q/2 [ \f\ p/2 \g\ q/2 . 



Likewise, replacing 1/p by 1/2 in the left hand side of (I2.2.ip leads to 

(2.4.2) J \f 9 \ > ii/h^imi;-^ 2 I m p/2 M ?/2 . 

It is easy to find examples showing that neither (12.4. ip nor (12.4.21) hold. Take for instance 
/ = 1 on [0, 1] and g = 2 X[0)1/2] . Then \\g\\ q = 2^, so 1 = / fg < ||s||^ /2 / \g\i/ 2 = 2 1 / 2 -^ 
and thus (12.4.21) fails. Choosing now g = 1 and / = 2^ [0,1/2] we have ||/|| p = 2 1 ~ 1 / p and 

1 > II/IIp~ p/2 / \f\ p/2 = 2 1 ' 2 - 1 /p so (12X11 does not hold either. 

A more indirect argument shows that in fact 1/q cannot be replaced by any fixed constant 
c G (0, 1/2) (independent of p, or equivalently, of q). Since (12.2.11) can be used to prove the 
uniform convexity of IP for p > 1, if there were such a c, then the upper bound in (I2.2.ip 
would not degenerate as p j 1, and we would be able to show that the modulus of convexity 
of L p is independent of p for every p e (1, 2], an obviously false result. 

Despite its obvious interest, not much work has been done, as far as I know, regarding 
stability versions of Holder's inequality. I am aware of two previous articles giving bounds 
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for the deviation from the case of equality. In [DraGoj the following result is presented: 



(2.4.3) 



< 1 



(\f\,\9\) 
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p \\a\\ q 
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where (/, p) := / /p. Note that (12.4.31) does not coincide with the rearranged parallelogram 
identity (12.0.21) when p = q = 2. 

An inequality more closely related to (I2.2.ip . which for nonnegative functions does extend 
(12.0.21) . appears in [PeSij . The argument is actually the same as the one used here (and in 
the standard proof of Holder's inequality), save for the fact that the initial refinement of 
Young's inequality is different from (12.1.11) . Suppose /, g > 0. By Theorem 2 of [PeSij, if 
1 < q < 2 < p < oo, then 
(2.4.4) 



f\\9 



\i/p 
W 



\q/p 
\q 



< 



Wf9\\: 



1 

< - 

- 2 



p/q 
p 



p/q\ 
p 



In addition to the factor 1/2 mentioned before, there are other differences between (I2.4.4P 
and (I2.2.ip . Note, for instance, that every term in (12.2.11) is finite, while for p > 2, whenever 
the support of g is not contained in the support of / the right hand side of (12.4.41) blows up. 

After submmiting this paper I have come accross the article |GGS] . where a refinement of 
Holder's inequality is obtained by using the positive definiteness of the Gram matrix. Write 
min{p _1 , q^ 1 }. Under the usual hypotheses, Theorem 2.3 of [GGS] states that 



m 



(2.4.5) 



(/,<?)< 



\9\\q(^-r) 



where r is an explicitly defined function of f p ^ 2 ,g g ^ 2 and a third normalized vector h E L 2 . 
Both inequalities (I2.4.5P and (12.2. ip have in common the use of L 2 to bound the deviation 
from equality. As differences, we note that (12.4.51) is one sided, and it does not reduce to the 
rearranged parallelogram identity when p = q = 2. 

Another relevant reference was found too late to include it in the accepted version of the 
manuscript, cf. [SiJ. The one sided refinement given there is less related to (I2.2.ip than those 
from |DraGoj . [PeSi] . and [GGS] . 

Remark 2.5. It is easy to give a stability version of the following standard variant of Holder's 



inequality: If r > 0, p + q 



f E LP, and g G L«, then ||/^|| r < 



\g\\q- From it 



and an induction argument, stability versions for multiple products can be obtained, that is, 
for the inequality ||n^ =1 /i|| r < n^ =1 ||/i|| Pi , where f { e L n and Y?i=\V~i X = 
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3. Interpolation- type consequences. 
In this section we derive some immediate interpolation-type results. Note that 



(3.0.1) 



\f\ p/2 



\9 



9/2 



p/2 
P 



\a\\ q/2 

\y\\i 



f\f\ p/2 \g\ q/2 



u\m 1/2 urn 



1/2 



and these quantities are strictly positive when q > 2. In what follows, both expressions will 
be used. 

Recall that on a probability space, if < r < s, then every / e L s satisfies \\f\\ r < ll/IU 
a fact that follows either from Jensen's inequality, or by writing |/| as the product |/| ■ 1 
and then applying Holder's inequality. From the equality case in either Jensen or Holder 
inequalities, we have ||/|| r < ||/|| s unless |/| is constant. This suggests that the deviation of 
|/| (or more precisely, of its normalized image under the Mazur map) from its mean value 
can be used to obtain finer bounds. 



Theorem 3.1. Let < r < s < oo, and let f e L s satisfy 
(3.1.1) 



> 0. If s < 2r, then 



2r 
1 

s 



\\f\ 



s/2| 



ll/| S/2 



l/r 



< 



, < 



1 



2(8 



\\f\ 



s/2| 



im s/2 



l/r 



while if s > 2r, the inequalities hold with 2r/s and 2(s — r)/s interchanged. 

Proof. We use Theorem (12.21) with p = s/r > 1, |/| r G L p , q = s/(s — r) > 1 and g = 1. 
Suppose first that s < 2r, i.e., that 1 < p < 2. Substituting in (12.2. ip and simplifying we get 
(13.1.11) . If 2 < p < oo argue in the same way and use the last part of Theorem 12.21 □ 

A more common measure of the dispersion of |/| S//2 /|| l/l^lh around its mean is the vari- 
ance Var. From the previous result it is possible to derive bounds for \\f\\ r in terms of 



Var(|/|*/ 2 /||| 



\s/2\ 



0- 



Corollary 3.2. Let < r < s < oo, and suppose < 



(3.2.1) 



2r 

1 Var 

s 



s/2 



H/| S/2 || 2 



l/r 



< 



< 



< oo. If s < 2r, then 
\f\ s/2 



Var 



Il/I s/2 || 2 



l/r 



while if s > 2r, the same inequalities hold, but with the terms 2r/s and (s — r)/s interchanged. 
Proof. Note that for all x G [0, 1] 

(3.2.2) 2^(1 - x 2 ) = 2-\l + x)(l - x) < 1 - x < 1 - x 2 . 

Next we set x = || |/| s ^ 2 ||i/|| |/| s ^ 2 ||2- Then x < 1 by either Jensen's inequality or more simply, 
the nonegativity of the variance. Substituting in (13.2.21) we obtain 



I f\ 



s/2 



II f I 



s/2 1 



II f I 



s/2 1 



I f\ 



s/2 
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Now (13.2.11) follows from (13.1.11) when s/r < 2, while if if 2 < s/r, we use the last part of 
Theorem 13.11 to obtain the corresponding inequalities. □ 

Theorem 13.11 and Corollary 13.21 are stability results, in the sense that ||/|| s and ||/|| r are 
"close" if and only if |/| is "nearly" constant; when Z(|/| s / 2 , 1) (or Var (|/| s/2 /|| |/| s/2 || 2 )) 
is sufficiently small, these norms are comparable. We believe these results will be useful in 
contexts where information is available about the first and second moments of a function, as 
is often the case in Probability Theory. 

Remark 3.3. It is easy to check that the factors between square brackets in the left hand 
sides of (13.1.11) and (13.2. lj) can actually be negative, so the positive part must be taken 
before raising them to the 1/r power. Take for instance, s = 2, any fixed r G (1,2), and 
/ = \/nx[o,i/n] on [0, 1], with n = n(r) "large enough". 

A variant of the result on containment of LP spaces exchanges the probability measure (or 
more generally, finite measure) hypothesis by the condition that / belongs to L Po , for some 
Po < p. We consider this next. 

Theorem 3.4. Let < po < p < p\ < oo, and let t = t(p) be given by the equation 
p- 1 = (1 - t)p 1 + tp\ x . Suppose f G L Po PI L Pl and / # 0. Lfp /p 1 < t~ l - I, then 



(3.4.1) WfWWSWl 



(3.4.2) <||/|| P <||/0/||* 

while if Po/ Pi > t~ x — 1, the inequalities are reversed, and the positive part of the term between 
square brackets is taken in the right hand side of {3.^.2^ . 

Proof. Again we use Theorem (Q, with the functions fO--t)pfp = fp } an d the conjugate 
exponents Po/[(l — t)p] and pi/tp. Note that po/[(l — t)p] > 1 and pi/tp > 1, while po/pi < 
r 1 - 1 if and only if p /((l - t)p) < 2. □ 

Remark 3.5. The preceding theorem leads to a midpoint interpolation result for arbitrary 
pairs of functions. Suppose, for instance, that f,h G L Po R L Pl , f,h^t 0, ||/|| po < ||^||p , 
and \\f\\ Pl < H^llpi- It is easy to see that ||/|| p > \\h\\ p may happen for some intermediate 
p G (pojPi). Consider the following example: Set f(x) = (1 — l/n)x[o,i/2] on [0,1], where 
n > 6 is fixed, and let h(x) = x. Then ||/||i < \\h\\i and ||/||oo < ll^lloo, but ||/|| n > \\h\\ n . 
Note that ||/|| p < \\h\\ p for every large enough p < oo; in particular, if n = 6 we can 
take pi = 11, so there is a reversal of the inequality at p — (po + pi)/2. However, under 
the additional condition on the angles Z(|/i| p °/ 2 , \h\ Pl/2 ) < Z(\f\ p °/ 2 , \f\ Pl/2 ), or equivalently, 
6(\h\ p °/ 2 , \h\ P1 / 2 ) < 6(\f\ p °/ 2 , |/| pi/2 ), at the midpoint p = (p + Pl )/2 we have ||/|| p < \\h\\ p 



2(l-t) Pl 

^1 -t)p! +tp 



I\f\ 

M 2 



PQ+gl 



(f\fW (/l/H 



1/2 



2tp 



(l-f)pi+to V (J\f\ p o) 1/2 (J\f\ p A 



1/2 
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whenever ||/|| Po < \\h\\ Po and \\f\\ Pl < To see this, note that if p = (po +Pi)/2, then 

^ = Pi/ (Po + Pi); so from (13.4.21) and (13.4.11) we get 

21 Vp 



< 



< 



X — * I 
po I 



h 



l-t 

PO 



I PI 



1 - 



j/lPo/2 



Pi/2 



ll/h /2 l 



i 



l/jlPo/2 



\ h \Pl/2 



\hW 2 \ 



l/ilpl/ 2 l 



1/p 



< 



Needless to say, stronger assumptions on the angles lead to stronger interpolation results. 
For instance, if 6(\h\ p °/ 2 , \h\P^ 2 ) < #(|/| Po/2 , |/| Pl/2 ), then ||/|| p < \\h\\ p for every p in some 
neighborhood of (po +p\)/2, since the quantities involved in ( 13.4.21) and (13.4. ip change con- 
tinuously. It is also possible to consider conditions of the type \\f\\ Pi < Ci\\h\\ Pi , with Cj > 
not necessarily equal to 1, or even to have h G L r ° fl L n with r, ^ pi, as is often done in 
interpolation theorems. But we will not pursue these elaborations here. 

Remark 3.6. In standard interpolation results, such as the Riesz-Thorin and the Marcinkie- 
wicz interpolation theorems, the pairing between the functions / and h = T(f) is not arbitrary 
but given respectively by a linear or sublinear operator T, and the conclusion, of course, is 
much stronger than anything contained in the previous remark. The attentive reader may 
wonder why more general pairings are interesting, or in other words, whether there is any need 
to go beyond sublinearity. Next we give an example where such a result might be useful. It 
involves the derivative DMf of the one dimensional, uncentered Hardy-Littlewood maximal 
function Mf, defined as follows: Given a locally integrable function / : R — > R, 



Mf(x) :=sup^ f \f(y)\dy, 
xel M I J i 



where / is any interval containing x and \I\ stands for its length. Starting with the paper 
|Kij . there has been in recent years a growing interest regarding the regularity of the maximal 
function (cf., for instance, [AIPe] and the references contained therein). Suppose for simplicity 
that / : R — > R is a compactly supported Lipschitz function. It is shown in |Ki] (cf. also 
|HaOnj ) that for every 1 < p < oo there is a constant c p (independent of /) such that 
1 1 DM/ ||p < Cp||.D/||p. However, the methods used in [Ki] and |HaOnj cannot tell us whether 
we actually have c p < 1, that is, whether the maximal operator M has a smoothing effect 
on /. For p = 1, Theorem 2.5 of [AIPe] states that ||DM/||i < ||D/||i, and C\ = 1 is sharp, 
while for p = oo, we have H-DM/Hoo < (y/2 - 1)||D/|U and = {^2 - 1) is best possible, 
by |ACPj . Thus, it is natural to conjecture "by interpolation" that whenever 1 < p < oo, 
the optimal constant c p satisfies c p < 1, and furthermore, linip^oo c p = y/2 — 1. Nevertheless, 
since the operator Df i— > DMf is neither linear nor sublinear, it falls outside the realm 
of currently available interpolation theorems. Unfortunately, the second endpoint for which 
information is available happens to be p = oo, so our stability version of Holder's inequality 
also fails to yield anything new on this question. 



Holder's inequality 



4. The triangle inequality and uniform convexity. 

Like Clarkson's inequalities and Hanner's inequalities, formula (12. 2.1ft can lay claim to being 
an L p generalization of the parallelogram identity. Furthermore, despite its easy proof, the 
refinement of Holder's inequality presented above does have strength: It gives, by sharpening 
Minkowski's inequality, the uniform convexity of L p spaces (at least in the real valued case), 
with the right asymptotic behavior of the modulus of convexity for all p G (l,oo). The 
exact asymptotic behavior was found by O. Hanner (cf. [Haj . or |LiTza2j . p. 63); Clarkson's 
original inequalities (see the Corollary in pg. 403 of [CI]) yield it over the range 2 < p < oo, 
but not for 1 < p < 2. 

The arguments presented here only cover the real valued case, and the complex valued case 
if p > 2. Since only the moduli of functions (and not their signs) play any role in the sizes of 
and ||/||p||<7||g, the same must necessarily happen with the error terms in any refinement 
of Holder's inequality. In particular, this is the case with (I2.2.ip . But for some applications, 
such as a refined triangle inequality, it would be preferable to control the departure from 
maximal size in terms of \f — g\ rather than ||/| — \g\\. We shall show that for real valued 
functions, and for complex valued functions when p > 2, one can assume the comparability 
of ||/ — g\\ p and ||/| — \g\\\ p . But the proof in the complex case when 1 < p < 2 has eluded us. 
A recent, new proof of uniform convexity, relying on the notion of thin slices and which does 
apply to the complex case, can be found in |HaOj (however, there the author is unconcerned 
about the precise behavior of the modulus of convexity). 

The improved Minkowski's inequality given next is obtained from our refinement of Holder's 
inequality by the usual "duality" argument. By the "duality" argument we do not mean 
knowing that the dual of L p is L q , but simply that 

(4.0.1) \\f\\ p = sup ffg, 

{9£Li:\\g\\ q =l} J 

which follows from Holder's inequality together with the trivial observation that equality is 
achieved when g = ^-y— J=^ . Here sign(z) := e 10 for every complex nonzero z = re t6 , and 

sign(0) := 1 (we adopt this convention, rather than the usual sign(0) := 0, since in order to 
multiply quantities without changing sizes it is useful to always have |sign(z)| = 1). As is 
well known, (14.0. ip immediately entails the triangle inequality: 
(4.0.2) 

\\f+h\\ p = sup /(/+%< sup fgi+ sup / hg 2 = \\f\\ P + \\h\\ p . 

{'jeLi:\\g\\ q =l} J { gi eLi:\\ gi \\g=l} J {g 2 eLi:\\g 2 \\ q =l} J 

However, usually this proof appears with the explicit maximizing g written in place of the 
first supremum, and then it proceeds from there. As it turns out, it will be more convenient 
for us to do likewise below. 



10 



J. M. Aldaz 



Theorem 4.1. Let 1 < p < oo. If f,h e LP, \\f\\ p , \\h\\ p > 0, and 1 < p < 2, then 
(4-1.1) 



\\f+h\\p< 



Q 



IP/2 
|p 



p/2 
P 















2 y 


\ ~ q 



\f + h\p/ 2 \h\p/ 2 



\\f + h 



I P/2 
|p 



\\f + h 

while if 2 < p < oo i/ie same inequality holds, but with 1/p replacing 1/q throughout 
Proof. Suppose 1 < p < 2. Then 



IP/2 



\\f + h\ 



if + hr 1 



■\f + h\< 



if+kr 1 



■1/1 + 



\f + h\ 



p-i 



■\h\ 



\\f + h\p-% ,j i -j hij+^-v 1 j mz+^r 1 ,,, 

and the result follows by applying (12.2.11) . If 2 < p < oo argue in the same way and use the 
last part of Theorem 12.21 □ 

Next, we recall some basic facts about the Mazur map ip ryS : IP — > L s . It is defined first 
on the unit sphere by vp r ,s(f) '■= \f\ r ^ s sign /, and then extended to the rest of the space 
by homogeneity (cf. [BeLij . pp. 197-199 for additional information on ipr,s)- The "angle" 



l/l 



p/2 



,19/2 



llallg 



in (12.2. ip is obtained by applying the Mazur maps from the nonnegative 

functions in the unit spheres of LP and L q , into the unit sphere of L 2 . Thus, we have control 
over the distortion, since when r < s, the map ip S)T is Lipschitz on the unit sphere of L s , with 
constant s/r, while its inverse ip r ^ s is Holder with exponent r/s. This is the content of the 
following well known lemma, included here for the reader's convenience. It is a special case 
of Proposition 9.2, pp. 198-199 of |BeLij . cf. also the proof of Theorem 9.1, pg. 198, partially 
sketched below. Note however that in |BeLij the harder, complex valued case is handled, and 
the Holder constant (as opposed to the Holder exponent) is not specified. We will consider 
the Mazur map acting only on nonnegative functions, since that is all we shall use. In this 
easy case we show that the Holder constant is 1. 



Lemma 4.2. Let 1 < r < s < oo, and let f, h > 0. If f, h G V satisfy 
then \\f r / s — h r / s \\ s < \\f — h\\r^ s , while if f,h G L s have norms 
\\f s/r -h s /%<{s/r)\\f-h\\ s . 



\h\\ r = 1, 

= 1, then 



Proof. To prove the Holder assertion, note that by concavity of t a for < a < 1, if a > 6, 
then a a — b a < (a — b) a . Suppose / and h are nonnegative functions of norm 1 in V . Taking 
a = r/s and integrating the pointwise inequality \f r ^ s (x) — h r ^ s (x)\ s < \f(x) — h(x)\ r we get 

iir /s -/i r/ i s <ii/-/i|i; /s . 

We sketch the proof the Lipschitz claim, directing the reader to |BeLij for additional details. 
Let us denote by dip S}r (f )(h) the Gateaux (i.e., the directional) derivative of the Mazur map 
based at the point / and in the direction of h, where the nonnegative functions / and h belong 
the unit sphere of L s . It is enough to show that ||aV>s,r(/)(^)||r < (s/r) r , which follows by 
explicit computation of the directional derivative, and an application of Holder's inequality 
together with ||/||. = \\h\\ s = 1. □ 
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After proving a simple lemma, we use the the properties of the Mazur map to express the 
preceding triangle inequality in terms of the p norm. 

Lemma 4.3. Let x, y, z be vectors in a normed space, and let p e (1, oo). Then \\x — y\\ p < 
2 p ~ l (\\x - z\\ p + \\y-z\\ p ). 

Proof. We may assume that x ^ y. Since y\\ < ||a;— z|| + ||y— z||, writing a := \\x—z\\/\\x— 
y\\ and b := \\y — z\\/\\x— y\\ we have that a + b > 1 and (a p + b p )\\x — y\\ p = \\x — z\\ p + \\y — z\\ p . 
Minimizing a p + ff subject to a + b > 1 and a, b > 0, we obtain a = b = 1/2, from which the 
result follows. □ 

Clarkson gave asymptotic estimates for the modulus of convexity of L p of order 0(e p ) when 
2 < p < oo and 0(e q ) when 1 < p < 2, where e — ||/ — W\ p . The optimal estimate 0(e 2 ) 
when 1 < p < 2 was found by Hanner. It is easy for us to explain this different behavior in 
terms of the Mazur map: When p < 2 the map ip2,p is Lipschitz, and hence the exponent 2 
in the error term from (12.0.21) or (12.2.11) is preserved, while if p > 2, then ^ 2 , p is 2/p-H61der, 
so the exponent 2 changes to p. 

Corollary 4.4. Let 1 < p < oo, and let f,h e L p . If 1 < p < 2, then 

(4.4.1) ||/ + h\\ p < 11/11, + \\h\\ p - min{||/|| p , \\h\\ p } 
while if 2 < p < oo, 

(4.4.2) ||/ + h\\ p < 11/11, + 11% - min{||/|| p , \\h\\ p } 

Proof. The result follows from Theorem 14.11 the previous Lemma, and Lemma 14.21 □ 

Suppose, in order to simplify the corresponding expressions, that ||/|| p = \\h\\ p = 1. A 
drawback of the preceding corollary is that in the right hand side we have || |/| — \h\ \\ rather 
than ||/ — h\\ p , while the left hand side depends on f + h, not on |/| + \h\. This is unavoidable 
since we are deriving the result from the stability version of Holder's inequality (12.2. ip . Thus, 
the case where |||/| — \h\\\ << \\f — h\\ must be handled via a separate argument, which 
somehow we have failed to find when / and h are complex valued and p < 2. The real valued 
case is easy since the only possibility for cancellation is to have opposite signs, and for p > 2 
the complex valued case immediately follows from the convexity of p/2. 

Note that the bound in the next proposition has nothing to do with uniform convexity: It 
holds even when p = 1. In fact, all we are doing is checking the intuitively obvious fact that 
if we want ||/ + h\\ p to be large, the signs of / and h must be very similar, specially if p is 
small. While this ought to be also true in the complex valued I said I have not been 

able to prove it. 

Proposition 4.5. Let 1 < p < oo, let < t < 1, and let f,h 6 LP be real valued functions. 
If \\\f\ ~ \h\\\* <4f- fe|g, then ||/ + fe|| p < ((11/11, + \\h\\ p f -(l-t)\\f- h\\§ 1/p . 



p{p - 1) 



\f\ \h\ 



Ifel 
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Proof. First, we may assume that / > 0, since by the convention sign(0) := 1 (adopted just 
after f)4.0.ip ) given any x we have \f(x) — h(x)\ = \f(x) sign f(x) — h(x) sign f(x)\, and 
likewise for \f(x) + h(x)\. Next, note that if a > and 6 6 1, then \a + b\ p + \a — b\ p = 
\a + \b\\ p + \a — \b\\ p , so writing f(x) = a, h(x) = b, and integrating, we get 



(4.5.1) \\f + h\\ p = \\f + \h\\\ p +\\f-\h 



11/ 



h\\*< 



+ 



\h\\ P r-(i 



mi 



h\\ p 



□ 



Remark 4.6. Note that by Taylor's formula (or by linear approximation at and concavity), 
wc have (1 - x) x l p < 1 — p l x. Applying this inequality to the conclusion of the previous 
proposition when ||/|L = \\h\\ p = 1, we get 



(4.6.1) 



f + h 



< 1 



1 - t 
p2 p 



11/ 



h\\ p 
"-lip- 



Let B be a Banach space. Clarkson's original definition of uniform convexity requires that 
for every < e < 2 there exist a 5(e) > such that if ||/|| = \\h\\ — 1 and ||/ — h\\ > e, then 
11^2^11 ^ 1 — fi( £ ) ( c -f Definition 1., pp. 396-397 of [CI]). The often used and seemingly weaker 
assumption ||/||, \\h\\ < 1 is of course equivalent to ||/|| = \\h\\ = 1, since / and h must have 
norm one in order to maximize ||/ + h\\ p subject to ||/ — h\\ > e (see Lemma 5.1 pg. 381 
of |Daj for a full proof). In the words of [B aCaLij . B is uniformly convex if its unit ball is 
"uniformly free of flat spots" . From the viewpoint of the geometry of B is is often interesting 
to have a good estimate of how 5 depends ones (0, 2]. The following definitions and results 
are taken from [LiTz a2j. specially pg. 63. The modulus of convexity 5b of B is given by 



(4.6.2) 



5 B (e) 



mil 1 



f + h 



\h\ 



h\\f~h\ 



We say that 5b is of power type r if there exists a constant c > such that 5b(s) > ce r . 
For B = L p and 1 < p < 2, 5 B {e) = (p - l)e 2 /8 + o{e 2 ), while for 2 < p < oo, 5 B {e) = 
e p /{p2 p ) + o{e p ). 

The next result shows that in the real valued case, the preceding variants of the triangle 
inequality yield the optimal value of r in the power type estimates. The constants, however, 
are not optimal. But they are not too far away from optimality either. We make an effort to 
obtain "fairly good" constants for the modulus of convexity (and not just good power type 
estimates, which is all one usually needs for applications) since this entails that the constants 
in the original inequality (12.2.11) must also be "fairly good" . 

Theorem 4.7. Let 1 < p < oo. Then B = L P (X,~R) is uniformly convex. Furthermore, its 
modulus of convexity satisfies the following inequalities. If p e (1,2], then for every c > 1 



there exists an e = e(c) such that for all f,h G LP with 



\h\\ p = 1 and \\f 



h\\ p <e, 



(4.7.1) 



MII/-*U> 



pip 



16c 



^\\f-h\\l 
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On the other hand, if 2 < p < oo, then for all f,hEL p with \\f\\ p = \\h\\ p = 1, 



Wf-h 



\p 



(4.7.2) s B (\\f-h\\ ) > ^. 

Proof. Note that for every p G (1, oo) and every £ G (0, 1), if || |/| — \h\ \\ p < t\\f — h\\ p , then 

(l-£) \\f-h\Z 

(4-7.3) S B (\\f-h\\ p )> K - ^ ^ 

by Proposition 14.51 or more precisely, by (14.6. lft . 

We prove gZ2D first. Given t G (0, 1), if |||/| - \h\\\ p > t\\f - h\\ p p , by (14X21) we have the 
bound 

(4-7.4) S B (\\f - h\\ p ) > " J 4p " p . 

Choosing £ so that the lower bounds given by (I4.7.3P and (I4.7.4|) are equal, (I4.7.2P follows. 

With respect to (14.7.11) . observe that for every £ G (0, 1) and ||/ — h\\ sufficiently small 
(depending on £), the bound 

0Pp(p-l) \\f-h\\l 

(4-7.5) 5 B (\\f - h\\ p ) > J^l h 

which follows from ( 14.4.11) when |||/| — \h\\\ p > t\\f — h\\ p , is always smaller than the bound 
given by (fl~7T3D when |||/| - \h\\\ p < t\\f - h\\ p . Writing c = £" 2 / p , gZU) follows by fixing 
e > small enough and taking \\f — h\\ < £. □ 

We have given an asymptotic estimate when 1 < p < 2 in order to be as precise as we can. 
If we are not concerned with good constants, to obtain a statement which does not require £ 
to be small we can just fix any £ (say £ = 2 _1 for definiteness) and take the minimum of the 
quantities given by (14.7.31) and (I4.7.5p . 

Next we consider the case of L P (X, C) spaces, when p > 2. The argument is essentially the 
same as in Proposition 14.51 

Proposition 4.8. Let 2 < p < oo, let < £ < 1, and let f,h G L p be complex valued 
functions. If \\\f\ - \h\ \\ p < t\\f-h\\ p , then \\f+h\\ p < ((\\f\\ P + \\h\\ p ) p - (1 - £)||/ - h\\ p ) 1/p . 

Proof. As before, we may assume that / > 0. Writing h = |/i|e ia , where a = a(h(x)), we 
have that for every x, 

(4.8.1) \f(x) + h(x)\ p + \f(x)-h(x)\* = 

\f 2 (x) + \h(x)\ 2 + 2f(x)\h(x)\cosa(h(x))\ p/2 + \f 2 (x) + \h(x)\ 2 - 2f(x)\h(x)\ cos a(h(x))\ p/2 . 
By the convexity of £ P//2 , 

(4.8.2) \f(x)+h(x)\> + \f(x)-h(x)\* 

< \f 2 (x) + \h(x)\ 2 + 2\h{x)\f{x)\ p ' 2 + \f 2 (x) + \h(x)\ 2 - 2\h(x)\f(x)\ p / 2 
= \f( x ) + \h\(x)\ p +\f(x)-\h\(x)\ p . 
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The rest of the proof is as in Proposition 14.51 □ 

Remark 4.9. From the preceding Proposition and the second part of Corollary 14.41 the 
uniform convexity of the L P (X, C) spaces when p > 2 follows in exactly the same way and 
with the same constants as in Theorem 14.71 so we avoid the repetition. 

Remark 4.10. As we have noted, a disadvantage of the refined triangle inequality given in 
Corollary 14.41 is that the error or stability term depends only on the moduli of the functions 
involved, and not their signs. But this inequality has its advantages also. One of them is 
that it interacts well with other inequalities given here, in the sense that it is easy to obtain 
nontrivial information by combining them. For instance, suppose fi(X) = 1 and < r < s, 
with f,h& L s . Under suitable hypotheses on the variance of \ f + h\ s ^ 2 , we can easily find 
bounds for ||/ + /i|| s in terms of ||/|| r and \\h\\ r , by using Theorem 13 .ll or Corollary l3.2l together 
with Corollary 14.41 Alternatively, we might be interested, say, in bounding ||/ + /i||r i n terms 
of || f\\ s and 1 1 h \\ s . Thus, there are several possibilities to study the behavior of ||/ + h\\ p as p 
changes. 
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